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Abstract 

In this paper we aim to solve the multiuser multi-input multi-output (MIMO) downlink beamforming 
problem where one multi-antenna base station broadcasts data to many users. Each user is assigned 
multiple data streams and has multiple antennas at its receiver Efficient solutions to the joint transmit- 
receive beamforming and power allocation problem based on iterative methods are proposed. We adopt 
the group maximum signal-to-interference-plus-noise-ratio (SINR) filter bank (GSINR-FB) as our beam- 
former which exploits receiver diversity through cooperation between the data streams of a user The data 
streams for each user are subject to an average SINR constraint, which has many important applications 
in wireless communication systems and serves as a good metric to measure the quality of service (QoS). 
The GSINR-FB also optimizes the average SINR of its output. Based on the GSINR-FB beamformer, we 
find an SINR balancing structure for optimal power allocation which simplifies the complicated power 
allocation problem to a linear one. Simulation results verify the superiority of the proposed algorithms 
over previous works with approximately the same complexity. 

EPICS MSP-MULT 
Index Terms 

Beamforming, linear precoding, MIMO, multiuser, broadcast channel. 



The material in this paper was presented in part at IEEE International Conference on Communications (ICC) 2009, Dresden, 
Germany, and ICC 2008, Beijing, China. 

Yu-Han Yang was with the Graduate Institute of Communication Engineering, National Taiwan University. He is now with the 
Department of Electrical and Computer Engineering, University of Maryland, College Park, USA (e-mail: yhyang@umd.edu). 

Shih-Chun Lin is with the Institute of Communications Engineering, National Tsing Hua University, Hsinchu, Taiwan, 30013 
(e-mail: linsc@mx.nthu.edu. tw). 

Hsuan-Iung Su is with the Department of Electrical Engineering and Graduate Institute of Communication Engineering, 
National Taiwan University, Taipei, Taiwan, 10617 (e-mail: hjsu@cc.ee.ntu.edu.tw). 



DRAFT 



2 



I. Introduction 

In this paper, the joint beamforming and power allocation optimization problem for the multiuser 
multi-input multi-output (MIMO) downlink channel is considered. In this system, transmit and receive 
beamformings are used to suppress the multiuser interference and exploit the multi-antenna diversity. 
Power allocation at the transmitter is performed to efficiently utilize the available transmission power. 
Such a joint beamforming and power allocation problem has been studied by many researchers |[TI-|[5l. 
In HI im, block diagonalization (BD) was proposed to block-diagonalize the overall channel so that 
the multiuser interference at each receiver is thoroughly eliminated. Such a zero-forcing approach suffers 
from the noise enhancement problem, because it removes the multiuser interference by ignoring the noise. 
Hence the performance can be improved if the balance between multiuser interference suppression and 
noise enhancement can be found lH lH. 

Under individual signal-to-interference -plus-noise-ratio (SINR) constraints for users, Schubert and 
Boche studied the situation where each user has only one data stream and single receive antenna H. It 
was shown that the optimal solution can be efficiently found by iterative algorithms. Khachan et al. 151 
generalized the scheme in 01 to allow several transmission beams to be grouped to serve a user, and 
each user has multiple receiver antennas HI. However, each data stream is processed separately. Thus, in 
addition to the multiuser interference from the other users, there is intra-group interference between the 
data streams of a user. This drawback motivates our work to use a more sophisticated receiver processing 
to tackle the intra-group interference. 

In this work, we adopt the group maximum SINR filter bank (GSINR-FB) proposed by IS as the 
beamformer, which collects the desired signal energy in the streams of each user and maximize the 
total SINR at its output. That is, the GSINR-FB lets these streams cooperate while the filters in lH 
let them compete. Based on the GSINR-FB beamformer, we consider a system which uses the average 
SINR over data streams for a user as a metric to measure the quality-of-service (QoS). This criterion is 
very useful in many communication scenarios IH-lIll including the celebrated space-time block coded 
systems. It will be shown that the GSINR-FB based beamformer does improve the performance over 
the scheme in Q. Moreover, we find that the SINR balancing structure exists for this beamforming 
method, that is, the optimal power allocation results in the same SINR to target ratio for all users with 
the GSINR-FB based beamforming. As will be shown later, this property makes solving the complicated 
power allocation problem much easier. Our work can be seen as a non-trivial generalization of lH to the 
multi-antenna setting which also subsumes Q as a special case (with independent processing of data 
streams). For simplicity, we will first consider group power allocation which restricts equal power on 
the data streams of each user to benefits from the low-complexity power allocation schemes similar to 
those in in in . This restriction is later relaxed by allowing the power of individual data streams to be 
adjustable. Besides the GSINR-FB based beamforming, this per stream power allocation scheme is new 
compared with HI Q and has better performance than the group power allocation. These two techniques 
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are the key ingredients to make our performance better than that in Q. With approximately the same 
complexity as |[5||, our approach exhibits a better performance compared to the existing methods in jSl 
and the BD based methods. 

We will investigate two optimization problems. One is minimizing the total transmitted power while 
satisfying a set of average SINR targets. The other is maximizing the achieved average SINR to target 
ratio under a total power constraint. Based on the uplink-downlink duality lO, our methods iteratively 
calculate the GSINR-FB based beamforming and power allocation matrices. The rest of the paper is 
organized as follows. The system model and problem formulation are introduced in Section |lll We also 
briefly discuss the basic design concept of our iterative algorithms in this section. Backgrounds such as 
the GSINR-FB based beamformers and the applications of the average SINR criterion are provided in 
Section [III] Section UV] presents our power allocation results. The numerical results are given in Section 
rvl and the computational complexity issues are discussed in Section IVH Finally, we give the conclusion 

inrvm 

II. Problem Formulation and Efficient Iterative Solutions 

A. Notations 

In this paper, vectors and matrices are denoted in bold-face lower and upper cases, respectively. For 
vector g, g >e means that every element of g is nonnegative. For matrix G, trace(G) denotes the trace; 

and G^ denote the transpose and Hermitian operations, respectively. ||-||^ denotes the Frobenius 
norm, which is defined as ||G||^ = y^trace (GG^). G^^ and \Gs\ are, respectively, the inverse and 
determinant of a square matrix G^. And I„ denotes the identity matrix of dimension n. A diagonal 
matrix is denoted diag{. . .} whose kth parameter is the kth diagonal term in the matrix. E[-] denotes 
the expectation operator. 

B. System Model 

Consider the downlink scenario with K users, where a base station is equipped with M antennas. The 
upper part of Fig. [T] shows the overall system block diagram for user k, who has receive antennas and 
receives data streams, where satisfies the constraint Lj^ < min{A/, A^^} to make sure effective 
recovery of the data streams at the receiver. Thus the K users have a total of = XlfcLi A^fc receive 
antennas receiving a total of L = grouped data streams. For a given symbol time, the data 

streams intended for user k are denoted by a vector of symbols x^, = [xki,Xk2, ■•■,XkLkV- The L data 
streams are concatenated in a vector x = [xf , ...,x^]^. Without loss of generality, we assume that x 
is zero mean with covariance matrix I^. The precoder Ufe G QMxLk processes user A;'s data streams 
before they are transmitted over the M antennas. These individual precoders together form the M x L 
global transmitter beamforming matrix U = [Ui, U2, U^^]. The power allocation matrix for user k is 
a diagonal matrix 

Pfc = diag{pki,pk2, ...,PkLj, (1) 
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where pkj is the power allocated to the jth data stream of user k, and the global power allocation matrix 

P = dm(7{Pi,P2,...,Pi^} (2) 



is a block diagonal matrix of dimension L x L. The transmitter broadcasts signals Uy Px to all of the 
K users. 

User k receives a length A*";; vector = H|^U\/Px, which can be expanded as 

yfc = Hf Ufc yP^Xfc + ( J2 yP^x,- I + rifc , (3) 

where the channel between the transmitter and user k is represented by the A^^ x M matrix H^, the 
Hermitian of H^; represents the zero-mean additive white Gaussian noise (AWGN) at user k's receive 
antennas with variance cr^ per antenna and the covariance matrix E[nkn^] = c^^Nk ■ The resulting NxM 
global channel matrix is H^, with H = [Hi,H2, ...,Ha']. We assume that the transmitter has perfect 
knowledge of the channel matrix H, and receiver k knows its perfectly. The second term on the 
right-hand-side of ^ is the inter-group multiple user interference for user k. To estimate its symbols 
Xfc, user k processes with its x receive beamforming matrix V^^^. The resulting estimated signal 
vector is 

xfc = vMuVPx + Vfrifc 

= VfH|^UfcV^Xfc + VfHf [ U.VP^x, ) +Vfnfc. (4) 

Without loss of generality, as ||6l, we assume that the interference-plus-noise components of the filter 
bank output in (01) are uncorrected. For any filter bank that produces correlated components, one can 
easily find another filter bank which makes these component uncorrelated but with the same performance. 
The details can be found in fSJ. 

Finally, owing to the non-cooperative nature between users in broadcast channels, the global receiver 
beamforming filter V^, formed by collecting the individual receiver filters, is a block diagonal matrix 
of dimension L x N where V = diag{[Vi,Y2, ...,Yk]}- 

C. Problem Formulation 



In this paper, we consider the average SINR of user k over all its L^. data streams SINR^ = 
X]j'=i SINRfcj/Lfc as the performance measure, where SINR^j is the SINR of the jth data stream of 
user k. The importance and applications of this design criterion will be reviewed in detail later in Section 
IIII-B I Based on the average SINR constraints and system model described in Section III-BI we consider 
two problems as follows. The first optimization problem, which will be referred to as Problem Pr in the 
following sections is 
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Problem Pr: Given a total power constraint Pmax and the SINR target 7^ for user k, maximize min SINRfc/7fc 

k 

over all beamformers U, V, and power allocation matrix P, that is, 

max min ^^^^^ subj. to V'V'pfci < -Pmax- (5) 

U,V,P k 71. ^-^ ^ 

fe=lj=l 



We call SINRfc/7fc the SINR to target ratio for user k. 

If the minimum SINR to target ratio in Equation ^ can be made greater than or equal to one, then 
the second optimization problem is to find the minimum power required such that the SINR targets can 
be all satisfied. The mathematical formulation of this problem, which will be referred to as Problem Pp 
in the following sections is 

Problem Pp: Given a constraint on the minimum SINR to target ratio, minimize the total transmitted 
power over all beamformers U, V, and power allocation matrix P as 

K Lfc SINRfc ^ 
min V'pfc,- subj. to min > 1 and V'r-j < -Pmax- (6) 

U,V,P ^-^ ^-^ ■' k -ih 

k=l j=l k=l j=l 

D. Iterative methods based on uplink- downlink duality 

We briefly review the uplink-downlink duality, which plays an important role in finding efficient 
solutions based on iterative methods for our problems. In ll9l- l[T2l . it was shown that it is always possible 
to find a virtual uplink system for the downlink system. We plot the virtual uplink for user k in the lower 
part of Fig. [TJ where is the corresponding power allocation matrix in the virtual uplink defined 
similarly as P^. To be more specific, with fixed beamforming filters U, V, SINR targets 71, . . . 
and the same sum power constraint Pmax for both the downlink and the virtual uplink, the downlink and 
its virtual uplink system have the same SINR to target ratio with optimal P and Q. 

With the aids of the uplink-downlink duality, the optimization problems Pr and Pp in Section III-CI can 
be solved efficiently with iterative algorithms. Now we introduce the basic concepts of these algorithms, 
as summarized in Table Jl For simplicity, we use Problem Pr as an example. From Table Jl for iteration 
n, with the downlink transmitter and receiver beamformers U*-"^ and V^") fixed, we can obtain a new 
power allocation matrix p{2"+i) to increase the minimum SINR to target ratio min SINR^./7fc- Note that 

k 

the downlink power allocation are executed two times (Step 1 and 3) for the nth iteration, as shown in 
Table in To simplify notations in the following sections, we use p(2"+i) p(2n+2) represent the new 
power allocation matrices for the first and second downlink power allocations respectively. With fixed 
p(2n+i) ^j^^ u("\ we can obtain a new downlink receiver beamformer V'^"^"^^ to increase SINRfc/7fc for 



all users. The minimum ratio minSINR/;/7fc is further optimized using the new power allocation matrix 

k 

p(2n+2) computed from U'^") and V'^""''^). Then we turn to the virtual uplink to update U^'^^. Similarly, 
fixing uplink transmitter beamformer V^*^"^^) and receiver beamformer U("\ we obtain a new uplink 
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power allocation matrix Q(2"+^). After power allocation, the SINR to target ratios of the downlink and 
virtual uplink are equal. Then we can find U("+^) based on Q(2"+i) and V("+^). After that, Q(2"+2) is 
updated according to the new U^""*"^) and v("+^\ and so on. 

Note that all the iterations are done at the transmitter, and the transmitter does not need to feed 
forward the optimized receive filters to the receivers during the iterations. The receiver can compute the 
final filter by itself after the iterative algorithm stops. This procedure is the same as |[T3l Section II-B], 
and we briefly describe it here. First, as in the "common training" phase in |[T3l |[T4l . each receiver k 
can estimate its own channel by using the known training sequence. After receiver k feeds back 
Hfe to the transmitter, the transmitter can iteratively compute transmit and receive beamforming filters, 
as well as power allocation matrices in Table U according to H^.. After the iterative algorithm stops, 
the "dedicated training" phase as in llT3l is performed to let the receivers compute the final receiver 
filter. In this phase, the transmitter will broadcast orthogonal training sequences to the receivers as in 
|[T3l . and each receiver can estimate the final equivalent channel formed by H^, the transmit filters, and 
power allocation matrices to calculate its final receive beamformer. We will first show how to calculate 
the beamforming filters in the next section, and then show how to use these filters to determine power 
allocation in Sections IIV-BI and IIV-CI 

III. Group Maximum SINR Filter Bank for the Average SINR Constraint 

In this section, we introduce the key motivation of our paper, that is, the use of GSINR-FB in ||6l as 
the beamfomer to solve ([5]) (O. This filter bank is a non-trivial generalization of the one used in lH. 
It uses the dimensions provided by the multiple receive antennas at each user more efficiently than lH. 
Specifically, the streams of each user (or group) cooperate with one another in our scheme, rather than 
interfere with one another as in [5j. Since this filter bank maximizes the total SINR of the streams of 
each user, it also maximizes the average SINR criterion adopted in this paper. We will also review the 
applications of the average SINR criterion at the end of this section. 

A. Group Maximum SINR Filter Bank 

To solve (O (O, the GSINR-FB is adopted for our transmitter beamformer U and receiver beamformer 
V to maximize the average SINR. Moreover, as will be shown in Proposition [H the optimal SINR 
balancing structure based on the GSINR-FB beamforming wiU make the corresponding power allocation 
problem trackable. Let us first focus on Step 2 in Table H that is, given U(") and p(2"+i), finding filter 

Lk 

V("+i) to maximize Yl SINR^/^, VA; (Lk times of the average SINR), where SINR};^/^ is the SINR of the 



jth stream of user k in this step. For brevity, we shall omit the iteration index n in most of the following 
equations. Following 161, the optimization problem becomes 




(7) 
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with 



where = [v^i, . . . , v^lJ, while 

RD^; = Hf UfcPfcU^Hfc and R^^ = ^ U,P,Uf + a^I^, , (8) 

are the signal covariance matrix and the interference-plus-noise covariance matrix for user k, respectively. 
It is now evident that we must let < min {M, iVfc} since the number of eigenvectors is limited by the 
dimension of H^. The optimization problem in ([7]) was shown to be equivalent to solving the generalized 
eigenvalue problems ^ as 

R.°fc vfcj = Al?,^R°>fc„ Vi (9) 

H-TfDL 

Then can be computed easily. The receive beamforming filter designed for the downlink can be 
carried over to the transmit beamforming filter for uplink, and vice versa. Thus the receive beamforming 
filter U("+^) for the virtual uplink system in Step 4 in Table U can be computed similarly. 

Now we show why the GSNIR-FB performs better than those in lH iQ. In ||51, all streams interfere 
with one another and v^^ satisfies 

"'s,kj^kj — '^M,kj^n,kj^kj, 

where X^}"j^j is the maximum generalized eigenvalue of (R^^^, Rj^^p; 

Tis,kj = Hf Ufc.ugHfc and R^^^- = ^ pnH^^Ufc.uf.Hfc + ^ U,P,Uf + a^lM, (11) 

are the signal covariance matrix and the interference-plus-noise covariance matrix for stream j of user 
k, respectively, and JJ^ = [uy^i, . . . , Uj^Lfc]- Comparing (fTTl) with ((S]), one can easily see that, in Q, 
the streams of the same user interfere with one another and there is additional intra-group interference 
in B,^\j (the first term of compared with in ([H). The GSINR-FB beamforming exploits 

additional dimensions from the multiple receiver antennas, which are not provided in Q (where N^-l), 
much more efficiently, by letting the streams of each user cooperate rather than compete as in Q. 

B. Average SINK criterion and its applications 

The average SINR criterion SINR^. is very useful in many communication systems m-lH and can 
serve as a good metric for the QoS. Here we briefly review some of its applications. Note that in these 



applications, it is the total SINR LfeSINR^ which serves as the performance metric, which equals to Lk 
times the average SINR. However, as will be discussed in Section |Vl to have a fair comparison with the 
results in Q where the per stream SINR is considered, the average SINR is used in the comparison. 
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Approximation of maximum achievable rate at low SINR ||3: The maximum achievable rate for user 



loe 



«log(l + LfcSlNRfc/r), 

where F is the SNR gap to capacity ifTSl P.432] |[T6l Chapter 7] due to suboptimal channel coding schemes 
and the limitation of circuit implementation in practical systems. According to |[T5l P.432], the gap is 
huge (8.8 dB) for uncoded PAM or QAM operating at 10"^ bit error rate. This approximation is also 
useful in systems with large numbers of users where the total interference power in ^ is large. 

Receiver SINR lUl lIU: Assuming that the maximum ratio combining (MRC) is applied to in (01), 



the receiver SINR at the output of the MRC is the sum of individual SINRs as L^SINRfc. This metric 
is very useful when space-time coding is applied and contains the space-time coded symbols. In this 
case, the decoding is based on the MRC results IQ. 

Minimization of the pairwise error probability Q: When a space-time block code (STBC) is appUed 
and Xfc contains the STBC symbols. Assuming that the channel is slow fading and remains constant 
during the transmission of a codeword, and that the maximum-likelihood detector is used at the receiver, 
one can approximately transform the minimization of the pairwise codeword error probability to the 



maximization of L^SINR^ following the steps in 17|. This approximation applies to both the orthogonal 
and quasi-orthogonal STBCs. 

IV. Power Allocation 

Now we focus on the optimal power allocation strategy for the Step 3 in Table Jl where the maximum 
SINR beamforming filter banks U^"^ v^""*"^) and a set of SINR targets 71,..., 7^ are given. The 
optimization problem corresponding to Problem Pr Q is 

. SlNR^ , . 

maxmm subj. to } } pkj < -Pmax- (13) 

p fc 71, ^-^ ^-^ 

k=i j=i 

The other one corresponding to Problem Pp Q which minimizes the total transmitted power, such that 
each individual SINR target can be achieved, is 

K Lk gjj^p^ DL K Lfc 

min^^Pfcj subj. to min ^ > 1, and ^ ^Pfcj < Pmax- (14) 

k=l j=l ^'^ k=l j=l 

We will first explore the structure of the optimal solutions for these problems in Section ITV-AI However, 
even with this structure which significantly simplifies the problems, the two per-steam power allocation 
problems are very complicated and the solutions in ||4l ||5l do not apply. Thus, we first intensionally 
introduce some restrictions to the power allocation strategies to simplify the problems and benefit from 
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the simple power allocation schemes similar to those in lH 121. In Section |Vl the simulation results show 
that even without the new per stream power allocation, the performance of lH 151 can be enhanced by 
simply applying the GSINR-FB as the beamformers. This verifies our motivation to use the GSINR-FB. 
The results for the simple "grouped" power allocation are presented in Section IIV-BI We then remove 
the restrictions and present the general per-stream power allocation results in Section IIV-CI The insights 
to why the proposed algorithms perform better than those in ||5l are given in Section IIV-DI 

A. Optimal SINR balancing structure under GSINR-FB beamforming 

r-DL 



By carefully rearranging the complicated SINR;, to a simpler equivalent form and using the properties 
of the GSINR-FB, we prove the following structure for the optimal power allocation which makes solving 
the complicated power allocation problems (fT3l) (fT4)) possible. 

Proposition 1: For the optimization problem (fT3] ). the optimal solution P makes all users achieve the 
same SINR to target ratio, that is, SINRfc^/7fc = C°^, for all k. Here is the SINR balanced level. 

Proof: The vector norms of the beamforming filters v^j, j = l...Lfc, can be adjusted such that 

1) Vf R°^Vfc is a scaled identity matrix 0, 

2) trace (Vf V^) = L^. 

When the above two conditions are satisfied, the average SINR of user k in the downlink scenario can 
be expressed as 

rDL i , 



1 ^ trace V, 



SINRfc = — V SINESI = ) ■ f . (15) 

^Stt trace VfR-V, 



Expanding Hs,k and R„,,fc, 



,DL _ trace (Vf H^^UfePfeUf HfcVfc) 



sinr;: = ^^-^ — ^ — " — . (i6) 

E trace Vf Hf U^PjUf HfeVj + L/^a^ 

Since trace (XY) = trace (YX) ITTl . the trace(-) terms can be written as 

trace(VfH|^U,PjUfHfcVfc) 
= trace(P,Uf HfcVfcVf Hf U,) ^^^^ 

Lj 

= Y^Pjii-^jkh, 
1=1 

where Ajk = Uj^HfcVfcV^H^^Uj and [Ajk]u denotes the Ith diagonal element of Ajk- Therefore, the 
average SINR of user k is 

Yl Pkl[Akk]ll 

SlNRfc^ = • (18) 

j=i,j^kl=i 
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Observing (fTSl ). we know that the maximizer of the optimization problem (fT3l) satisfies 



rDL 



^^^^ = CDL, 1 < /fc < (19) 

lk 

The reason is as the following. Since [Ajk]ii > 0, yj,k,l, each SINR^^ is strictly monotonically 
increasing in p^i and monotonically decreasing in pji for j / A;. Thus all users must have the same SINR 
to target ratio C^^. Otherwise, the users with higher SINR to target ratios can give some of their power 
to the user with the lowest ratio to increase it, which contradicts the optimality. 

■ 

Following the same steps of the above proof, the SINR balancing structure also exists for Problem Pp 
in (fT4l) . Now we can solve power allocation problems ([T3] ) and (fT4l) with the aid of Proposition [U which 
makes these problem trackable as shown in the following. 

B. Simplified Solution: Group Power Allocation 

For clarity, we present the simple group power allocation first then the general per-stream power 
allocation in the next subsection. The group power allocation intentionally restricts the power allocation 
strategy to make the complicated power allocation problem with multiple receiver antennas similar to the 
simple one in ifTSl llH where A'^^ = 1. Thus the group power allocation takes the advantage of the spatial 
diversity provided by the GSINR-FB based beamforming to improve the performance, while keeping the 
complexity moderate. 

To be more specific, the allocated power for a user using the group power allocation is evenly distributed 
over all streams of that user as 

Pki = Pk2 = ■■■ = PkLk, l<k<K. (20) 

Let the power allocated to user khe, p^. Consequently, the diagonal power allocation matrix for user 
k can be written as a scaled identity matrix, that is, 

Pfc = 7^lL.. (21) 

^k 

We also define a vector p = [pi, . . . ^PkV to replace matrix P in the optimization problems. Substituting 
Pfc = f^lLfc into Equation ( fTSl ). the average SINR in problems ( [T3] ) and (fT4l ) is 



rDL %\\^k^k^k\ 



F 



SINRfc = ^ ^ . (22) 

Eil|VfHfU,||^ + a2 

j¥=k 

With the "grouped" constraint on the power allocation strategy (|2T]) . the simplified average SINR (l22l) for 
Nk > I has the same structure as that in ifTSl HI where Nk = 1. Thus the solutions of this simplified group 
power allocation for Problems Pr and Pp can be easily obtained. These solutions are briefly presented in 
the following subsections. The overall optimization algorithms are also summarized at the end of each 
subsection. 
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Group Power Allocation for Problem Pr: With the SINR balancing structure from the GSINR-FB 
beamforming in Proposition [U the group power allocation for Problem Pr ( fT3] ) can be solved by a simple 
eigensystem as 

TTP = ^P, (23) 
where the extended coupling matrix T and the extended power vector p are defined as 



and 



P 

1 



(24) 



respectively, where 

|vfHfUi||^'-' ||vMUk||^ 



D = d^ag { „__^r.'JL . ....ZtH „2 (25) 



||V""H-^U ■ IP 

and the ijth element of the K x K matrix ^ is zero when j = i or - — ' ^ '^^ when j ^ i. 

By using Proposition U and the simplified average SINR (l22l) in ([T3] ). the rest of the proof of the 
previous results is similar to those in ifTSl ||4l and omitted. With ^ and (1231 ). we summarize the final 
optimization algorithm for Problem Pr in Table JIJ which iteratively calculates the optimal beamforming 
filter and power allocation vector between the downlink and the uplink, where eig means the generalized 
eigenvalue solver. Due to the uplink-downlink duality described in Section III-DI it is guaranteed that the 
uplink balanced level C^^ equals to the downlink balanced level C^^. 

Group Power Allocation for Problem Pp: Again, with Proposition 1, the minimizer of (fT4l) satisfies 

rDL 



SINRfc = 7fc, l<k<K. (26) 

Substituting (|26l ) into (l22l ). the resulting power allocation vector is 

p = (I - D*)-^Dcr. (27) 

The optimal q for the virtual uplink can be obtained similarly. The overall algorithm for Problem Pp is 
summarized in Table JII] which iteratively finds the optimal solution minimizing the required power. 

Note that (|27] ) does not necessarily have a solution with nonnegative elements. When there exists at 
least one nonnegative power allocation satisfying the target SINR constraints and total power constraint 
^max in (fT4l ). we call the system feasible. Depending on the channel conditions, the total power required 
to achieve the target SINRs could be quite large and exceed Pmax- For the purpose of studying the effects 
of the algorithms on the system feasibility, we use the sum power allocation algorithm in Table JI] with 
a large Pmax (43 dBm) to check the feasibility as in HI, iQ. In checking the feasibility, as soon as 
the balanced level becomes larger than 1 (which means that a feasible solution can be obtained), the 
algorithm switches to the power minimization steps. On the other hand, if the balanced level remains 
below 1 when the feasibility testing stage ends, the feasibility test fails and the power minimization 
algorithm stops. In practical applications, when the system is infeasible, one must relax the constraints 
by reducing the number of users K or decreasing the target SINR. 
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C. General Solution - Per Stream Power Allocation 

Now we remove the restriction of evenly distributing power in a group in Section lTV-BI The performance 
is expected to be further improved since the group power allocation is a subset of the per stream power 
allocation. The general power allocation solutions presented in this subsection are much more complicated 
than the results in HI 151. The overall optimization algorithms for Problems Pp and Pr are also summarized 
at the end of each subsection. 

Per Stream Power Allocation for Problem Pp: The power minimization problem using the result of 
Proposition [T] becomes 

"^^"^^^Pkj S.t. ^^Pkj < -Pmax and SINRfc^ = 7k,l<k< K. (28) 



With the equivalent SINR expression in (1181 ). we will show that (1281 ) can be elegantly recast as a well- 
known linear-programming problem. We first recall that the average SINR of user k (fTSl) is 

Yl Pkl [Afcfc]zz 
i-DL 1=1 



SINRr = . (29) 

j=ijVfci=i 

Substituting (|29l ) into (|28] ). the original power minimization problem turns into a linear programming 
problem, that is, 

mill 

S.t. Y.Pkl[Akk]llhk- Y. Y Pjl[Ajk]ll = Lkcr'^ (30) 

1=1 j=l,jJLkl=l 

for k = 1, ...,K, and p >e 0, 
where p represents the vector comprising the diagonal elements of P as in Section IIV-BI It is known 
that a linear programming problem can be solved in polynomial time using, for example, the ellipsoid 
method or the interior point method |[T9ll . 

Table |IV] summarizes the proposed iterative algorithm with group maximum SINR beamforming and 
per stream power allocation. The virtual uplink power allocation problem can be similarly solved as 
(l30l ) with Ajk replaced by Bj^ = Yj^n^JJkV^UjYj. Like the group power minimization algorithm 
in Table |llll the feasibility of this algorithm should also be checked using the per stream sum power 
allocation which is described in the next subsection. 

Per Stream Power Allocation for Problem Pr : With a fixed beamforming matrix U, a fixed receive 
filter V, and a total power constraint, the optimization problem obtained by applying Proposition [U in 
(US) is 

max C^^ 
p 

>DL 



7fc ' (31) 



K L 

and Y Y Pkj = Pi 
k=ij=i 
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rDL 



where SINRj^ is rearranged in form 

The optimal power allocation vector for this complicated problem is difficult to obtain, thus we consider 
a suboptimal solution which can be found by simple iterative algorithms. First, using the concept of 
waterfilling, we fix the proportion of the power of data streams in each group according to the equivalent 
channel gains. That is, let 

Pki ■ Pk2 PkLk = [Afcfcjii : [Afcfc]22 : . . . : [Afcfc]^^^^, 

for k = l,...,K. 

Therefore, the variables pki PkLk be reduced to one variable tf^ such that pi^i = tk [A^klu I E \^kk\ 

i=\ 

for each I, and E Pu = ^k- The SINR for user k in Equation (|29l ) can be rewritten as 

tk{Y.\^kk\llY.\^kk\, 

SlNR^"^ = . ^ \ (33) 

,1=1 1=1 / 




tkdkk 



E ij9jk + Lkcr"^ ' 



(34) 



K K Lk 

with E = E E Pki = -Pmax- 

fc=l k=ll=l 

We then solve problem (|31| ) with concepts similar to the sum power iterative water-filling algorithm 



proposed in EOl . The nth iteration of the algorithm is described in the following. Note that this problem 
has a similar form as ([73] ). thus the balanced levels, defined as SINRfc/7/fc, of all users must be equal 
according to Proposition [U At each iteration step, we generate a new effective level gain for each user 
based on the power of other users from the previous step t°, j ^ k as 

^ dkk/lk /ic\ 

E tj9ik + LkCF^ 

for k = 1,...,K. The K power variables t^s are simultaneously updated subject to a sum power 
constraint. In order to maintain an equal level, we allocate the new power proportionally to the inverse 
of the level gain of each user as 

p 

, ^ max 

tk = ^ ■ (36) 

^kE 

Note that when updating t^, the power variables of other users are treated as constants and > 0. 

Similarly, for the virtual uplink, we denote the power variable for user k as Sk and the effective level 
gain for user k as Hf^. The proposed algorithm for the overall problem Pr is summarized in Table jV] 
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D. Insights to the performance advantage of the proposed approaches 

The insights to why the proposed approaches outperform those in lH llH are discussed as follows. First, 
under the same power allocation matrix P in (l5]l and the GSINR-FB will perform better than the 
beamformers in 151. This is because the streams of each user cooperate with one another in our scheme 
rather than interfere with one another as in Q. The mathematical validation was given in Section ITlI-AI 
Indeed, as shown in ||6l Section III], the GSINR-FB includes the minimum mean-squared error (MMSE) 
filter used in ||4l Q as a special case (without cooperation). Thus the GSINR-FB should have a better 
performance. As for the power allocation part, note that our sub-optimal group power allocation has a 
formulation similar to that of the power allocation methods in IH Q. Thus they should be similar in terms 
of optimality. Our more complicated per-stream power allocation includes the group power allocation as 
special case. Therefore it should perform better than the group power allocation and the power allocation 
methods in El S. 

Finally, we note that we have no proof whether our iterative algorithms converge to the global optimum 
or merely local optima. However, as shown by the simulation in the next section, the local optima still 
result in much better performance than @ 121. 

V. Simulation Results 

In this section we provide some numerical results to illustrate the advantages of the proposed algorithms 
over and the simple BD methods 0. The design concept of the BD transmit beamformer is to remove 
the inter-user interference in ([3]) completely. A BD beamformer can be found when Af > X]£i i^k ^i^k. 
To solve Problems Pr and Pp in dSj and ([6]), respectively, and to maximize the average SINR of the worst 
user, we also apply the GSINR-FB as the receive beamformers for the BD cases. Note that this paper 
focuses on the QoS of individual users, where the average SINR serves as a metric of QoS. For the BD 
cases, the conventional BD receive beamformer design is more for the purpose of sum rate maximization 
(with waterfilling power allocation), which usually does not maximize the SINR of the worst user. Thanks 
to Proposition [T] the corresponding power allocations can be derived similarly to those in Section ITV-B I and 
the details are omitted here. We also consider both the group and per stream power allocation strategies 
for BD, named "group BD" and "per stream BD", respectively. 

For the system simulation parameters, the channel matrix is assumed flat Rayleigh faded with 
independent and identically distributed (i.i.d.) complex Gaussian elements with zero mean and unit 
variance. The noise is white Gaussian with variance 1 W. The transmitter is assumed to have perfect 
knowledge of the channel matrix H^, and each user knows its own equivalent channels as discussed 
in Section III-DI Since typically the transmitter has more antennas than the receivers, we set the number 
of streams equal to the number of receive antennas for user k. Without loss of generality, we 
assume a common SINR constraint 7 for all users, i.e., 7^ = 7 for all k. In the following simulation, 
we generate 1000 channel realizations and average the performance. The convergence criterion e of the 
iterative algorithms is set to 10"'^. 
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Fig. |2] shows the simulation results of the balanced level C^^ versus total power Pmax for Problem 
Pr, where C^^ is defined as in Proposition [T] The two proposed algorithms in Table |ll] and Table |V] 
are compared with the method proposed in and BD. Note that in fSj, the data streams are processed 
separately and the balanced levels are the same for all streams. With a common SINR constraint 7 to 
be satisfied by all streams, the per-stream balanced level defined in [51 gives the same value as the 
balanced level C^^ defined in Proposition [T] So the comparison of C^^ is fair in Fig. |2l The simulation 
parameters are: K = A users, M = 8 transmit antennas, each user has 2 receive antennas and 2 streams 
{Nk = Lk = 2, \/k), and the SINR constraint 7 = 1. For each channel realization, all the three algorithms 
run until convergence but for at most 50 iterations. For fair comparison, only the cases where all the 
three algorithms have converged within 50 iterations are considered in averaging the performance. We 
will discuss the convergence probabilities later. It can be seen that the proposed group power allocation 
achieves higher balanced levels than the method in IJl at the positive SINR region. The proposed per 
stream power allocation further outperforms group power allocation. Similarly, the per stream BD achieves 
higher balanced levels than the group BD since the group BD is a special case of the per stream BD. 
Note that the BD schemes perform better when the total available power Pmax is high and perform 
worse when the available power is low, since BD is a zero-forcing method which suffers from the noise 
enhancement problem at low Pmax- When extremely large power is available, BD will perform close to 
the proposed methods. However, the operating region where this phenomenon is obvious needs a much 
higher power than our setting in Fig. |2l We do not show the simulation results in this region since it is 
less practical. 

In Fig. [3l we plot the minimum total required power Pmin versus SINR constraint 7 for Problem Pp. 
Simulation parameters are K = 2 users, M = 8 transmit antennas, and each user has A^^ = 4 receive 
antennas and = 4 streams. Again, for the method in Q, a common SINR target 7 has to be achieved 
by all streams. Thus it has the same average SINR target 7 for each user as the other algorithms. For each 
channel realization, all algorithms first perform feasibility test using a large Pmax = 43 dBm. Feasibility 
test for the method in Q can be done similarly as the proposed algorithms. As soon as the feasibility test 
passes, the corresponding algorithm switches to the power minimization steps and runs until convergence 
but for at most 50 iterations. Feasibility test for BD can be done trivially. Only the cases where all the 
algorithms have passed the feasibility test, and converged within 50 iterations, are considered in averaging 
the performance. Again, we will defer the discussions for the infeasible cases and the convergence issues 
later. As shown in the figure, the proposed group power allocation performs better than the method in 
|[5l at high SINR. However, at low SINR it requires more power. This is because group power allocation 
suffers for the fact that it cannot adjust the power within a group as the method in Q. At low SINR, 
the interference is larger and the method in lO can adjust the power within a group to better deal with 
the interference. On the other hand, the proposed per stream power allocation performs better than the 
other algorithms at both high and low SINR. Similar to Fig. |2l the BD methods perform better than the 
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method in Q at high SINR, but are worse than the proposed methods in all cases presented. The results 
at extremely high power, where the performances of BD and the proposed methods are close, are not 
shown due to the same reason discussed before. 

We also present the sum rate comparison in Fig. |4] where the balanced levels of all users are the same 
(as in Fig. O as an indication of the QoS guaranteed and the fairness achieved. The simulation parameters 
are the same as in Fig. |2] Note that the sum rates of both BD methods are worse than the method in 
||5l, while their balanced levels cross over that of Q in Fig. |2l This is because under the same average 
SINR, the method in iQ will make all streams of a user have equal SINR and achieve the highest sum 
rate due to the concavity of the log function. Thus when a scheme's balanced level advantage over the 
method in ISl is not significant enough (e.g., the BD schemes), its sum rate may be lower than that of 
im. We emphasis again that our algorithms focus on the QoS (average SINR) of individual users. Our 
problem formulations are fundamentally different from those focusing on sum rate optimization and not 
guaranteeing the QoS. 

Now we show the feasibility and convergence properties of the proposed algorithms. In the above 
simulation setting, the number of transmit antennas M is equal to the total number of data streams of 
all users Xlfc^i ^fc (also equal to the total number of receive antennas J^fc^i ^k)- We further consider 
the cases where M < X^a^i by increasing the number of users K. That is, for Problem Pp, K = 3, 
M = 8, and Nk = Lk = 4,VA;; while for Problem Pr, K = 5, M = 8, Nk = Lk = 2,\fk. We name 
these cases as Case 2 and the settings for Fig. |2] and [3] as Case 1. Note that typically the system will 
perform scheduUng |[2ll when M < J2k=i ^k, that is, it uses time-division multiple access (TDMA) to 
schedule a number of users such that M = J2k=i ^k each time. Thus the simulation results of Case 1 
represent the performance of fully loaded systems and those of Case 2 well represent the performance 
of over-loaded systems. 

First we discuss the feasibility issues. From the simulations of Case 1, we observed that the proposed 
algorithms and the method in Q passed the feasibility test for almost all channel realizations. Intuitively, 
group power allocation is more feasible than the method in Q because the average, instead of per 
stream, SINR constraints are easier to be achieved, and they make Equation (|27] ) better conditioned than 
the corresponding equation in Q. Thus nonnegative solutions of (l27l ) are easier to be found. In addition, 
since the group power allocation is a special case of the per stream power allocation with equal power 
distribution among the streams of a user, the per stream power allocation method should be even more 
feasible. As an example, when a high target SINR (7 = 12 dB) is desired, simulation shows that the 
probabilities of feasibility for the group power allocation, the per stream power allocation, the method 
in 121, group BD and per stream BD are 99%, 100%, 67%, 100%, and 100%, respectively. For Case 
2, the system can only support lower target SINR and the probabilities of feasibility for the above five 
algorithms when 7 = 3 dB are 100%, 100%, 0%, 0%, and 0%, respectively. Note that for Case 2, the 
BD based methods can not be applied since M is not large enough. 
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As for the insights to the convergence behavior, typically each optimization step improves its objective 
function as outlined in Section III-DI The beamforming step maximizes each user's sum SINR of the 
data streams and the power allocation step optimizes the balanced level. As an example, in Fig. |5j we 
plot the balanced levels versus iteration times of the two proposed algorithms for Problem Pr under the 
same channel conditions. The total power constraint is set to 15 dBm, and the SINR constraint 7 = 1. 
The arrows point at the numbers of iterations where the algorithms meet the convergence criteria. From 
the figure, the two proposed algorithms typically do not oscillate often and exhibit smooth transient 
behaviors. We also observe that the convergence behavior of the group power allocation is slightly better 
than that of the per stream power allocation, i.e., the per stream power allocation is not as smooth as 
the group power allocation and needs more iterations to approach the balanced level. The reason why 
the per stream power allocation has worse convergence behavior is that after a power allocation step, the 
noise whitening property obtained by the previous maximum SINR filter bank may no longer be valid, 
that is, Yj^tl^^Yk may no longer be a scaled identity matrix. This effect may decrease the balanced 
level. However, in most cases this negative effect has a small impact on the eventual performance. Table 
IVIII lists the iteration times needed to converge for both problems. From these results, we can see that all 
three methods need more iterations to converge in Case 2. Note that for Problem Pp, the target SINRs 7 
for Case 2 are smaller than those of Case 1 since the method in |!51 is not feasible for 7 > 3 dB. Also, 
the method in Q needs significantly more iterations when 7 = 2 dB. 

Fig. |6] shows the probabilities of the proposed algorithms and the method in fSl converging within 50 
iterations given that they have passed the feasibility test, for the power minimization problem in Case 1 
(settings of Fig. |3]). We can see that the group power allocation and the method in O both exhibit good 
convergence probabilities while the per stream power allocation converges better at low SINR than at high 
SINR. The reason for the lower convergence probability of the per stream power allocation is that the 
linear programming makes the algorithm prone to oscillation between feasible solutions from iteration 
to iteration. In practice, as long as the solution is a nonnegative power vector, the SINR constraints 
are achieved, no matter the algorithm oscillates or not. Moreover, even when the per stream power 
allocation oscillates at the final iterations, typically the SINRs are still higher than that of the group 
power allocation. So one can simply pick the solution at the final iteration and still obtain a better 
performance. The other way is to avoid oscillation by switching to the group power allocation whenever 
the per stream power allocation algorithm oscillates. The performance of this combined algorithm should 
be between the performance of the per stream power allocation and the group power allocation. Fig. [T] 
shows the convergence probability for Problem Pp in Case 2. Since the method in ||5l is not feasible when 
SINR constraint for 7 > 3 dB, we only plot for 7 < 3 dB. The group power allocation still converges 
almost surely in this overloaded case. 
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VI. Computational Complexity 

In Table I VII we compare the computational complexity in one iteration based on the number of complex 
multiplications. From this table, one can see that there is no single step which dominates the complexity 
for each algorithm, so we list all of them for comparison. For each optimization step, the complexity of 
group power allocation (Table ^ or power minimization (Table HiH) is lower than that of the method in 
im, and we have shown in Section |V] that the performances of the proposed methods are also superior. 
The reason for the complexity saving of the group power allocation method is due to the fact that the 
method in lH processes the streams separately (matrix dimension L), while the group power allocation 
processes the streams of a user jointly (matrix dimension K, K < L). For the per stream algorithms 
(Table |V] and UVl ). the complexity of power allocation is at most slightly higher than that of the method 
in in, but the performance is much better. 

In addition to the computational complexity in one iteration, the average number of iterations needed for 
convergence also affects the system complexity. The average numbers of iterations for the three algorithms 
in the simulation settings of Fig. [3] and Fig. |2] are shown in Table lVIl1 (for the power minimization Problem 
Pp, the average number of iterations needed by the feasibility test is included). This table shows that 
the group power allocation method has the fastest convergence among the three algorithms, while the 
per stream power allocation has the slowest convergence. Compared to the method in Q, the group 
power allocation has a lower computational complexity, converges faster and performs better. If more 
complicated computation is allowed, the per stream power allocation exhibits even better performance. 

As for the BD algorithms used in this paper, the computation of the zero-forcing transmit beamformers 
has approximately the same complexity as that of the "uplink beamforming" step of Group (Pr) in Table 
rvit while the complexity for receiver beamformers is approximately the same as that of "downlink 
beamforming" step of Group (Pr). The complexity of the power allocation steps is negligible compared 
with those of the beamformers. Since the BD algorithms do not need iterations, they are not listed in the 
comparisons in Tables |Vl] and IVIII (nor in Fig. O. 

VII. Conclusion 

Efficient solutions to the joint transmit-receive beamforming and power allocation under average SINR 
constraints in the multi-user MIMO downlink systems were proposed. The beamforming filter is a GSINR- 
FB which exploits the intra-group cooperation of grouped data streams. Due to this selection, the SINR 
balancing structure of optimal power allocation holds and simplifies the computation. Based on the 
uplink-downlink duality, we formulated the dual problem in the virtual uplink, and iteratively solved the 
optimal beamforming filters and power allocation matrices. The proposed algorithms are generalizations 
of the one in lH to the scenario with multiple receive antennas per user, and exploit the receiver diversity 
more effectively than ISj. Simulation results demonstrated the superiority of the proposed algorithms over 
methods based on independent data stream processing IH and BD in terms of performance. Moreover, 
the computational complexities of the proposed methods are comparable with that of |[5l . 
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TABLE I 

Basic Steps of the tith Iteration 



1: First Downlink Power Allocation 

Fixed U'"' and V<"', find new p(2"+i) 
2: Downlink Receive Maximum SINK Beamforming 

Fixed p(2"+i) and U'"', find new V("+^) 
3: Second Downlink Power Allocation 

Fixed U'"' and V("+^', find new p(2"+2) 
4: First Virtual Uplink Power Allocation 

Fixed V("+'' and U'"', find new q(2"+i) 
5: Virtual Uplink Receive Maximum SINR Beamforming 

Fixed Q(2"+i) and find new u("+^' 

6: Second Virtual Uplink Power Allocation 

Fixed U("+1' and V("+i', find new q(2"+2). 
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TABLE II 

Iterative algorithm for Problem Pr with group power allocation 



Initialization: U = I, V = I 
Iteration: 

1 ; First Downlink Power Allocation with Sum Power Constraint 
Solve p in T 

2: Downlink Receive Maximum SINK Beamforming 
for fc = 1 : A" 

3: Second Downlink Power Allocation with Sum Power Constraint 
Solve p in T 

4: First Virtual Uplink Power Allocation with Sum Power Constraint 
Solve q in A 

5: Virtual Uplink Receive Maximum SINR Beamforming 
for fc = 1 : A" 

Ufc = ez5(RUi;,RUL) 

6: Second Virtual Uplink Power Allocation with Sum Power Constraint 
Solve q in A 

7: Repeat steps 1-6 until convergence, i.e.. |C°l("' - C'DL{n-i)| ^ ^ 
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TABLE III 

Iterative algorithm for Problem Pp with group power allocation 



Initialization: Feasibility test using the algorithm in Table HH , if failure then exit. 
Iteration: 

1 ; First Downlink Power Minimization 

p = (I D*)^Do- 
2: Downlink Receive Maximum SINR Beamforming 

3 : Second Downlink Power Minimization 

p = (I D*)^Do- 
4: First Virtual Uplink Power Minimization 

q= (I-D*^)^D(T 

5: Virtual Uplink Receive Maximum SINR Beamforming 

for fc = 1 : iC 

Ufe = eiff(RUL,RUL) 

6; Second Virtual Uplink Power Minimization 
q= (I-D*^)^D(T 

7: Repeat steps 1-6 until convergence, i.e., \C^^ — 1| < e 
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TABLE IV 

Iterative algorithm for Problem Pp with per stream power allocation 



Initialization: Feasibility test using the algorithm in Table Ivl , if failure then exit. 
Iteration: 

I: First Downlink Power Minimization 

Solve p in the linear programming problem: 
min l^p 

s.t. Pki[Akk]ii /^k - I] I] Pj;[Ajfc]a = ifeo- , 
1=1 j=i,j/fci=i 

for k — 1, K, and p >e 
2: Downlink Receive Maximum SINR Beamforming 
for k = l: K 

3: Second Downlink Power Minimization 

Solve p in the linear programming problem: 
min l^p 

Lk K Lj 

S.t. Pfci[Afcfc];;/7fc - Yl PjA'^jk]!! = Lk(T , 

1=1 j = l,j^kl = l 

for k — 1, K, and p >e 
4: First Virtual Uplink Power Minimization 

Solve q in the linear programming problem: 
min l^q 

S.t. qki[^kk\nhk - Yl £ 9ji[Bfcj]H = ifeo- , 
1=1 j=i.jj^ki=i 

for k = I, K, and q >e 

5: Virtual Uplink Receive Maximum SINR Beamforming 

for k = l: K 

U, =eip(RU^,RUL) 

6: Second Virtual Uplink Power Minimization 
Solve q in the linear programming problem: 
min l^q 

Lk K Lj 

S.t. Y iki[^kk\u/^k - Y gji[Bfej]ii = ifeo-^, 

1=1 j=l,j^kl=l 

for k — 1, K, and q >e 
7: Repeat steps 1-6 until convergence, i.e., \C^^ — 1| < e 
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TABLE V 

Iterative algorithm for Problem Pr with per stream power allocation 



Initialization: U = I, V = I 
Iteration: 

1: First Downlink Power Allocation with Sum Power Constraint 

2: Downlink Receive Maximum SINR Beamforming 
foi k^l: K 

3: Second Downlink Power Allocation with Sum Power Constraint 

+ ^ (2„ + l) ^ Wt, 

4: First Virtual Uplink Power Allocation with Sum Power Constraint 

5: Virtual Uplink Receive Maximum SINR Beamforming 
foi k^l: K 

6: Second Virtual Uplink Power Allocation with Sum Power Constraint 

(2n+2) _ , rr(2n+l) _ fafcfc/7fc 

— ... K ' wnere ii^ — — — (2„+i) 



T 1 2^ "jfc- 



Repeat steps 1-6 until convergence, i.e., |C°^'"^ - ^ ^ 



TABLE VI 

Complexities of the optimization steps in one iteration 



Uplink Uplink Power Downlink Downlink 

Beamforming Allocation Beamforming Power 

Allocation 



Group (Pr) 


0(7^M^) 


0{{K^lf) 


o(Ef= 




0{{K + lf) 


Group (Pp) 


0{KM^) 




o(Ef= 




0{K'^) 


Per Stream (Pr) 


0{KM''^) 




o(Ef= 


aLl) 




Per Stream (Pp) 


0{KM''^) 


Oil'"'') 


o(Ef= 




0(L3'5) 


Khachan's (Pr) 


0{LAr^) 


0{L') 


o(Ef= 




0{L') 


Khachan's (Pp) 


0{LM'^) 


0{L') 


o(Ef= 




0{L') 
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TABLE VII 

Average numbers of iterations needed for convergence 



Case 1 (Fully loaded) 
Problem Pp Problem Pr 



SINR constraint (7) 


2 


4 


6 Pmax(dB) 


10 


12 


14 


Group 


9.6500 


10.7200 


11.7200 


12.359 


12.608 


12.558 


Khachan's 


15.3720 


17.7600 


21.7200 


13.857 


13.316 


13.382 


Per Stream 


25.7861 


26.1097 


30.2598 


15.475 


14.871 


13.906 






Case 2 (Over loaded) 










Problem Pp 






Problem Pr 




SINR constraint (7) 


-2 





2 Pmax(dB) 


10 


12 


14 


Group 


10.41 


11.88 


14.96 


15.43 


17.37 


20.31 


Khachan's 


16.63 


22.97 


40.93 


14.11 


14.58 


15.72 


Per Stream 


25.78 


26.10 


30.25 


17.53 


18.3 


17.04 



DRAFT 



26 



Downlink 




Virtual Uplink 




Fig. 1. MIMO downlink system model for user k and its virtual uplink. 
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Fig. 2. Compaiison with jS) and BD for Problem Pr. K = 4, M = 8, Nk = 2, Vfe. 




Fig. 3. Comparison with and BD for Problem Pp. K = 2, M ^ 8, Nk = 4, Vfc. 
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Fig. 4. Sum rate comparison with (5| and BD for Problem Pr. K = 4, M ^ 8, Nk ^ 2, Vfc. 
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Convergence Behavior 
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Case 1, Per stream MSINR 
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Fig. 5. Convergence behaviors of the proposed algorithms for Problem Pr in Case 1 (fully loaded) and Case 2 (over loaded). 
The arrows point at the numbers of iterations where the algorithms meet the convergence criteria. 
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Power Minimization, K=2, IV1=8, N=[4 4] 



1 

0.9 
0.8 



t 0.7 

CO 

-8 0.6 

D. 

8 0.5 

(D 



0.4 



<3 0.3 
0.2 
0.1 




Group IVISINR 

Khachan's 

Per stream MSINR 



III 111 111 111 111 



-3-2-10123 
SINR constraint y (dB) 



Fig. 6. Comparison of the convergence probabilities for Problem Pp in Case 1 (fully loaded): K — 2, M = 8, Nk ~ 4, Vfc. 



Power minimization, K=3, M=8, N=[4 4 4] 



1 

0.9 
0.8 
^ 0.7 

CO 

^ 0.6 

Q. 



CD 

S3 0.4 
> 

o 

O 0.3 



0.2 
0.1 




Group IVISINR 

Khachan's 

Per stream MSINR 



-4 -3 -2 -1 
SINR constraint Y (dB) 



Fig. 7. Comparison of the convergence probabilities for Problem Pp in Case 2 (over loaded): K = 3, M = 8, Nk = 4, Vfc. 
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